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Abstract: We consider a variety of nearest-neighbor spin models defined on the d-dimensional 
hypercubic lattice Z'*. Our essential assumption is that these models satisfy the condition of reflec- 
tion positivity. We prove that whenever the associated mean-field theory predicts a discontinuous 
transition, the actual model also undergoes a discontinuous transition (which occurs near the mean- 
field transition temperature), provided the dimension is sufficiently large or the first-order transition 
in the mean-field model is sufficiently strong. As an application of our general theory, we show 
that for d sufficiently large, the 3-state Potts ferromagnet on Z'^ undergoes a first-order phase tran- 
sition as the temperature varies. Similar results are established for all g-state Potts models with 
g > 3, the r-component cubic models with r > 4 and the 0(A'^)-nematic liquid-crystal models 
with iV > 3. 
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1 . Introduction 

1.1 Motivation and outline. 

Mean-field theory has traditionally played a seminal role for quahtative understanding of phase 

transitions. In fact, most practical studies of complex physical systems begin (and sometimes 
end) with the analysis of the corresponding mean field theory. The central idea of mean-field 
theory — dating back to [15, 53] — ^is rather compelUng: The ostensibly complicated interactions 
acting on a particular element of the system are replaced by the action of an effective (or mean) 
external field. This field causes a response at the point of question and its value has to be self- 
consistently adjusted so that the response matches the effective field. The practical outcome 
of this procedure is a set of equations, known as the mean-field equations. In contrast to the 
original, fully interacting system, the mean-field equations are susceptible to direct analytical 
or numerical methods. 

There is a general consensus that mean-field predictions are qualitatively or even quantitatively 
accurate. However, for short-range systems, a mathematical foundation of this beUef has not been 
presented in a general context. A number of rigorous results have related various lattice systems to 
their mean-field counterparts, either in the form of bounds on transition temperatures and critical 
exponents, see [19, 20, 52] and references therein, or in terms of limits of the free energy [48] 
and the magnetization [12, 41] as the dimension tends to infinity. In all of these results, the 
nature of the phase transition is not addressed or the proofs require special symmetries which, 
as it turns out, ensure that the transition is continuous. But, without special symmetries (or 
fine tuning) phase transitions are typically discontinuous, so generic short-range systems have 
heretofore proved elusive. (By contrast, substantial progress along these Unes has been made 
for systems where the range of the interaction plays the role of a large parameter. See, e.g., 
[10,11,14,47].) 

In this paper we demonstrate that for a certain class of nearest-neighbor spin systems, namely 
those that are reflection positive, mean-field theory indeed provides a rigorous guideline for the or- 
der of the transition. In particular, we show that the actual systems undergo a first-order transition 
whenever the associated mean-field model predicts this behavior, provided the spatial dimension 
is sufficiently high and/or the phase transition is sufficiently strong. Furthermore, we give esti- 
mates on the difference between the values of parameters of the actual model and its mean-field 
counterpart at their corresponding transitions and show that these differences tend to zero as the 
spatial dimension tends to infinity. In short, mean field theory is quantitatively accurate whenever 
the dimension is sufficiently large. 

The main driving force of our proofs is the availability of the so called infrared bound [18, 
22-24], which we use for estimating the correlations between nearest-neighbor spins. It is worth 
mentioning that the infrared bound is the principal focus of interest in a class of rigorous results 
on mean-field critical behavior of various combinatorial models [13, 30-32, 37, 39] and percola- 
tion [29,33-36,38,40] based on the technique of the lace expansion. However, in contrast to these 
results (and to the hard work that they require), our approach is more reminiscent of the earlier 
works on high-dimensional systems [1-3], where the infrared bound is provided as an input. In 
particular, for our systems this input is a consequence of reflection positivity. (As such, some 
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of our results can also be extended to systems with long-range forces; the relevant modifications 
will appear in a separate pubhcation [9].) 

The principal substance of this paper is organized as follows: We devote the remainder of 
Section 1 to a precise formulation of the general class of spin systems that we consider, we then 
develop some general mean-field formalism and, finally, state our main theorems. Section 2 con- 
tains a discussion of three eminent models — Potts, cubic and nematic — with specific statements 
of theorems which underscore the first-order (and mean-field) nature of the phase transitions for 
the large-d version of these models. In Section 3 we develop and utilize the principal tools needed 
in this work and provide proofs of all statements made in Section 1. In Section 4, we perform 
detailed analyses and collect various known results on the mean-field theories for the specific 
models mentioned above. When these systems are "sufficiently prepared," we apply the Main 
Theorem to prove all of the results stated in Section 2. Finally, in Section 5, we show that for any 
model in the class considered, the mean-field theory can be realized by defining the problem on 
the complete graph. 

1.2 Models of interest. 

Throughout this paper, we will consider the following class of spin systems on the d-dimensional 
hypercubic lattice 1^^: The spins, denoted by Sx, take values in some fixed set which is a 
subset of a finite dimensional vector space Eq. We will use (• , •) to denote the (positive-definite) 
inner product in Eq and assume that $7 is compact in the topology induced by this inner product. 
The spins are weighted according to an a priori Borel probability measure [i whose support is VL. 
An assignment of a spin value to each site x G Z'^ defines a spin configuration; we assume 
that the a priori joint distribution of all spins on Z*^ is i.i.d. Abusing the notation slightly, we 
will use [i to denote the joint a priori measure on spin configurations and use (— )o to denote the 
expectation with respect to p,. 

The interaction between the spins is described by the (formal) Hamiltonian 



Here (x, y) denotes a nearest-neighbor pair of TL"^, the quantity b, playing the role of an exter- 
nal field, is a vector from Ef^ and /3, the inverse temperature, has been incorporated into the 
(normalized) coupling constant J > and the field parameter b. 

The interaction Hamiltonian gives rise to the concept of a Gibbs measure which is defined as 
follows: Given a finite set A C Z*^, a configuration S = (S^x)xeA ir* ^ a boundary condition 
S' = (S'^)j.g2d\A in Z"^ \ A, we let (3Ha{S\S') be given by (1.1) with the first sum on the right- 
hand side of (1.1) restricted to (x, y) such that {x, yjCiA / 0, the second sum restricted to x G A, 

( S') 

and Sx for X ^ A replaced by Sx- Then we define the measure v)^ ' on configurations /S in A 
by the expression 




(1.1) 



X 



uf\dS) 



^-f3HA{S\S') 



p{dS) 



(1.2) 



Za{S') 
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where Z\{S') is the appropriate normalization constant which is called the partition function. 
The measure in (1.2) is the finite-volume Gibbs measure corresponding to the interaction (1.1). 

In statistical mechanics, the measure (1.2) describes the thermodynamic equilibrium of the spin 
system in A. To address the question of phase transitions, we have to study the possible limits 
of these measures as A expands to fill in ifi. In accord with the standard definitions, see [26], 
we say that the spin model undergoes & first-order phase transition at parameter values ( J, &) if 
there are at least two distinct infinite- volume limits of the measure in (1.2) arising from different 
boundary conditions. We will call these limiting objects either infinite-volume Gibbs measures 
or, in accordance with mathematical-physics nomenclature, Gibbs states. We refer the reader 
to [26, 52] for more details on the general properties of Gibbs states and phase transitions. 

We remark that, while the entire class of models has been written so as to appear identical, the 
physics will be quite different depending on the particulars of Q. and jjL, and the inner product. 
Indeed, the language of magnetic systems has been adapted only for linguistic and notational 
convenience. The above framework can easily accommodate any number of other physically 
motivated interacting models such as lattice gases, ferroelectrics, etc. 

1.3 Mean-field formalism. 

Here we will develop the general formalism needed for stating the principal mean-field bounds. 
The first object of interest is the logarithmic moment generating function of the distribution jj,, 

G{h) = log f ii{dS)e^^'^\ (1.3) 

Jo. 

Since Q. was assumed compact, G{h) is finite for all h G E^. Moreover, h ^ G{h) is continuous 
and convex throughout Eq. 

Every mean-field theory relies on a finite number of thermodynamic functions of internal re- 
sponses. For the systems with interaction (1.1), the object of principal interest is the magnetiza- 
tion. In general, magnetization is a quantity taking values in the closed, convex hull of O, here 
denoted by Conv(n). If m G Conv(n), then the mean-field entropy fiinction is defined via a 
Legendre transform of G{h), 

S{m) = inf {G{h) - {m,h)}. (1.4) 

(Strictly speaking, (1.4) makes sense even for m ^ Conv(fi) for which we simply get S{m) = 
— oo.) In general, m i— > S{m) is concave and we have S{m) < for all m G Conv(J7). From 
the perspective of the large-deviation theory (see [16, 19]), the mean-field entropy function is (the 
negative of) the rate function for the probability that the average of many spins is near m. 

To characterize the effect of the interaction, we have to introduce energy into the game. For 
the quadratic Hamiltonian in (1.1), the (mean-field) energy function is given simply by 

Ej,b{m) = ~J\mf - (m, b), (1.5) 

where |mp = (m, m). On the basis of physical considerations, a state of thermodynamic equi- 
hbrium corresponds to a balance between the energy and the entropy. The appropriate thermody- 
namic function characterizing this balance is the free energy. We therefore define the mean-field 
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free-energy function by setting ^ j,b{f^) = ^J^bifn) — S{m), i.e., 




(1.6) 



The mean-field (Gibbs) free energy Fmf(^, b) is defined by minimizing <Pj5(m) over all m G 
Conv(r2). Assuming a unique minimizer, this and (1.4-1.5) give us a definition of the mean-field 
magnetization, entropy and energy. A more interesting situation occurs when there is more than 
one minimizer of <Pj The latter cases are identified as the points of phase coexistence while the 
former situation is identified as the uniqueness region. 

For the sake of completeness, it is interesting to observe that every minimizer of ^j h{m) (in 
fact, every stationary point) in the relative interior of Conv(ri) is a solution of the equation 



where V denotes the (canonical) gradient in Ej^. This is the mean-field equation for the magne- 
tization, which describes the self-consistency constraint that we alluded to in Section 1.1. The 
relation between (1.7) and the stationarity of <P j^b is seen as follows: V<?j,5(m) = implies that 
Jm + 6 + VS{m) = 0. But h = — VS'(m) is equivalent to m = VG(/i), and stationarity 
therefore implies (1.7). 

We conclude with a claim that an immediate connection of the above formalism to some 
statistical mechanics problem is possible. Indeed, if the Hamiltonian (1.1) is redefined for the 
complete graph on N vertices, then the quantity ^j b{m) emerges as the rate function in a large- 
deviation principle for magnetization and hence FmfI J, b) is the free energy in this model. A 
precise statement and a proof will appear in the last section (Theorem 5.1 in Section 5); special 
cases of this result have been known since time immemorable, see e.g. [19]. 

1.4 Main results. 

Now we are in a position to state our general results. The basic idea is simply to watch what 
happens when the value of the magnetization in an actual system (governed by (1.1)) is inserted 
into the associated mean-field free-energy function. We begin with a general bound which relies 
only on convexity: 

Theorem 1.1 Consider the spin system on 1^^ with the Hamiltonian (1.1) and let vj^i, be an 

infinite-volume Gibbs measure corresponding to the parameters J > and b G Eq in (1.1). 
Suppose that vj ^ is invariant under the group of translations and rotations ofU^. Let {—)j,b 
denote the expectation with respect to vj b and let be the magnetization of the state vj^t, 
defined by 



m = VG{Jm-\-b), 



(1.7) 



(1.8) 



where denotes the origin in Z'^. Then 




(1.9) 



where x denotes a nearest neighbor of the origin. 
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Thus, whenever the fluctuations of nearest-neighbor spins have small correlations, the physical 
magnetization almost minimizes the mean-field free energy. The bound (1.9) immediately leads 
to the following observation, which, to the best of our knowledge, does not appear in the Uterature: 

Corollary 1.2 Let z^j f, and (— ) j,f, be as in Theorem 1.1 and let be as in (1.8). Then 

{{S^,Sy))j,^>\m^\'' (1.10) 

for any pair of nearest-neighbors x,y & Z*^. In particular, for any model with interaction (1.1), 
the nearest-neighbor spins are positively correlated in any Gibbs state which is invariant under 
the translations and rotations ofL^. 

Our next goal is to characterize a class of Gibbs states for which the correlation term on the 
right-hand side of (1.9) is demonstrably small. However, our proofs will make some minimal 
demands on the Gibbs states themselves and it is therefore conceivable that we may not be able 
to access all the extremal magnetizations. To define those values of magnetization for which 
our proofs hold, let F( J, 6) denote the infinite-volume free energy per site of the system on U^, 
defined by taking the thermodynamic limit of — ^ log Za, see e.g. [50]. (Note that the existence 
of this limit follows automatically by the compactness of O.) The function F( J, 6) is concave 
and, therefore, has all directional derivatives. Let ^( J, 6) be the set of all pairs [e*, m*] such 
that 

F(J + AJ, 6 + A6) - F(J, h) < e^AJ + (m^, Ab) (1.11) 

holds for all numbers A J and all vectors A& € E^- By a well-known result (see the discussion of 
the properties of subdifferential on page 215 of [51]), J, b) is a convex set; we let J, b) 
denote the set of all values such that [e*, m*] is an extreme point of the set J^{J, b) for some 
value e*. 

Our Main Theorem is then as follows: 

IVIaiii Theorem. Let d > 3 and consider the spin system on 'L^ with the Hamiltonian (1.1). 
Let n denote the dimension of Kq. For J > and b G Eq, let rrii, € .^*( J, b). Then 

^J,b{m^)< inf^ ^j,b{m) + Jn'^h, (1.12) 

■mGConv(n) 2 



where k = m.axs£n{S , S) and 

^ (27r)'i D{k) 



7[-7r,7rl' 



with D{k) = l-\ Ej=l C08{ky). 

The bound (1.12) provides us with a powerful method for proving first-order phase transitions 
on the basis of a comparison with the associated mean-field theory. The key to our whole program 
is that the "error term", Jn^I^, vanishes in the d — >^ oo limit; in fact, 

h=^{l + o{l)) as d^oo, (1.14) 
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Figure 1 . The mean-field free energy as a function of a scalar magnetization m( J) for the 
typical model undergoing a first-order phase transition. In an interval of values of J, there are 
two local minima which switch their order at J = Jmf- If the "barrier" height A(J) always 
exceeds the error term from (1.12), there is a forbidden interval of scalar magnetizations and 
m( J) has to jump as J varies. The actual plot corresponds to the 3-state Potts model for J 
taking the values (a) 2.73, (b) 2.76, (c) 2.77 and (d) 2.8. See Section 2.1 for more details. 

see [12]. For d sufficiently large, the bound (1.12) thus forces the magnetization of the actual 
system to be near a value of m that nearly minimizes ^j^b{m). Now, recall a typical situation 
of the mean-field theory with a first-order phase transition: There is a Jmf such that, for J near 
Jmf, the mean-field free-energy function has two nearly degenerate minima separated by a bar- 
rier of height A( J), see Figure 1. If the barrier A( J) always exceeds the error term in (1.12), 
i.e., if A(J) > Jn^Id, some intermediate values of magnetization are forbidden and, as J in- 
creases through Jmf, the physical magnetization undergoes a jump at some J near Jmf- See also 
Figure 2. 

The Main Theorem is a direct consequence of Theorem 1 . 1 and the following lemma: 

Key Estimate Let J > and b € Kq and let € ^vc( J, b). Let n, k and be as in the 
Main Theorem. Then there is an infinite-volume Gibbs state vj^bfor interaction (1.1) such that 

m^ = {So)j,b (1.15) 

and 

{{S^, Sy))j^^ - |m^p < uKld, (1.16) 
for any nearest-neighbor pair x,y £ Z'^. Here ( — ) j,5 denotes the expectation with respect to vj^b- 
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The Key Estimate follows readily under certain conditions; for instance, when the parameter 
values J and 6 are such that there is a unique Gibbs state. Under these circumstances, the bound 
(1.16) is a special case of the infrared bound which can be derived using reflection positivity 
(see [18,22-24]) and paying close attention to the "zero mode." Unfortunately, at the points of 
non-uniqueness, the bound in (1.16) is also needed. The restriction to extreme magnetizations is 
thus dictated by the need to approximate the magnetizations (and the states which exhibit them) 
by states where the standard "RP, IRB" technology can be employed. 

The Key Estimate and Theorem 1.1 constitute a proof of the Main Theorem. Thus, a first- 
order phase transition (for d':^ V) can be established in any system of the form (1.1) by detailed 
analysis of the full mean-field theory. Although this sounds easy in principle, in practice there are 
cases where this can be quite a challenge. But, ultimately, the Main Theorem reduces the proof 
of a phase transitions to a problem in advanced calculus where (if desperate) one can employ 
computers to assist in the analysis. 

1.5 Direct argument for mean-field equation. 

We have stated our main results in the context of the mean-field free energy. However, many 
practical calculations focus immediately on the mean-field equation for magnetization (1.7). As 
it turns out, a direct study of the mean-field equation provides us with an alternative (albeit exis- 
tential) approach to the results of this paper. The core of this approach is the variance bound for 
the magnetization stated as follows: 

Lemma 1.3 Let d > 2> and consider the spin system on TH^ with the Hamiltonian (1.1). Let n 
and Ifi be as in the Main Theorem. For J > and b € Eq, let rrii, G J, b). Then there is an 
infinite-volume Gibbs state vj^bfar the interaction (1.1) such that rrii, = (<So) J,b '^nd 

E S:c-mJ) <nJ-^Id, (1.17) 
2d I j,b 

x: \x\=l 



where (— ) j,6 denotes the expectation with respect to uj ^. 

Here is how the bound (1.17) can be used to prove that mean-field equations are accurate in 
sufficiently large dimensions: Conditioning on the spin values at the neighbors of the origin and 
recalling the definition of G{h), the expectation {So)j,b can be written as 

J 



{So)j,b={VG(- Yl + (1-18) 



x: 



\x\=l ^ ' 

Since the right-hand side of (1.17) tends to zero as d ^ oo, the (spatial) average of the spins 
neighboring the origin — namely ^ Ylix- \x\=i — probabihty, very close to m*. 
Using this in (1.18), we thus find that approximately satisfies the mean-field equation (1.7). 
Thus, to demonstrate phase coexistence (for d » 1) it is sufficient to show that, along some 
curve in the parameter space, the solutions to the mean-field equations cannot be assembled into 
a continuous function. In many cases, this can be done dramatically by perturbative arguments. 

While this alternative approach has practical appeal for certain systems, the principal drawback 
is that it provides no clue as to the location of the transition temperature. Indeed, as mentioned 
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Figure 2. The solutions of the mean-field equation for the scalar order parameter m as 
a function of J for the 10-state Potts model. The solid lines indicate the local minima, the 
dashed lines show the other solutions to the mean-field equation. The portions of these curves 
in the regions where m is sufficiently close to zero or one can be (rigorously) controlled 
using perturbative calculations. These alone prove that the mean-field theory "does not admit 
continuous solutions" and, therefore, establish a first order transitions for d » 1. The shaded 
regions show the set of allowed magnetizations for the system on Z'^ when < 0.002. 
In addition to manifestly proving a discontinuous transition, these provide tight numerical 
bounds on the transition temperature and reasonable bounds on the size of the jump. 

in the paragraph following the Main Theorem, secondary minima and other irrelevant solutions 
to the mean-field equations typically develop well below J = Jmf- Without the guidance of the 
free energy, there is no way of knowing which solutions are physically relevant. 



In this section we adapt the previous general statements to three models: the g-state Potts model, 
the r-component cubic model and the 0(A^)-nematic liquid crystal model. For appropriate ranges 
of the parameters q, r and N and dimension sufficiently large, we show that these models undergo 
a first-order phase transition as J varies. The relevant results appear as Theorems 2.1, 2.3 and 2.6. 

2.1 Potts model. 

The Potts model, introduced in [49], is usually described as having a discrete spin space with q 
states, ax & {1,2, ... ,q}, with the (formal) Hamiltonian 



Here S^^cry is the usual Kronecker delta and 3 = To bring the interaction into the form of (1.1), 
we use the so called tetrahedral representation, see [54]. In particular, we let = {vi, . . . , v^}. 



2. Results for specific models 




(2.1) 
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where denote the vertices of a — l)-dimensional hypertetrahedron, i.e., G with 

{1, if a = /3, 

1 (2.2) 
— ^ , otherwise. 
9-1' 

The inner product is proportional to the usual dot product in Exphcitly, if Sx & ^ corre- 

sponds to ax G {!,...,(/}, then we have 

q-l 1 

i^xj ^y) = Sx ■ Sy = 5(j u . (2.3) 

q " q 

(The reason for this rescaUng the dot product is to maintain coherence with existing treatments 
of the mean-field version of this model.) The a priori measure /x gives a uniform weight to 
all q states in Q,. 

Let us summarize some of the existing rigorous results about the g-state Potts model. The 
q = 2 model is the Ising model, which in mean-field theory as well as real life has a continuous 
transition. It is believed that the Potts model has a discontinuous transition for all d > 3 and 
q > 3 (see, e.g., [54]). In any d > 2, it was first proved in [45] that for q sufficiently large, 
the energy density has a region of forbidden values over which it must jump discontinuously 
as J increases. On the basis of FKG monotonicity properties, see [4], this easily impUes that the 
magnetization is also discontinuous. Such results have been refined and improved; for instance 
in [44,46], Pirogov-Sinai type expansions have been used to show that there is a single point of 
discontinuity outside of which all quantities are analytic. However, for c? > 3, the values of q for 
which these techniques work are "astronomical," and, moreover, deteriorate exponentially with 
increasing dimension. 

Let m^{J) and e*(J) denote the the actual magnetization and energy density, respectively. 
These quantities can be defined using one-sided derivatives of the physical free energy: 



d 



d 



&=o+ ' ' dJ' 



or, equivalently, by optimizing the expectations ((vi, Sq)), resp., ^{{Sq, Sx)), where "0" is the 
origin and x is its nearest neighbor, over all Gibbs states that are invariant under the symmetries 
of Z*^. Recalling the Fortuin-Kasteleyn representation [4,21,27,28], let Poo{J) be the probability 
that, in the associated random cluster model with parameters p = 1 — e"'^/^^'^^ and q, the origin 
lies in an infinite cluster. Then mi,{J) and Poo{J) are related by the equation 

m*(J) = ^P^{J). (2.5) 

q 

As a consequence, the magnetization ■m^{J) is a non-decreasing and right-continuous function 
of J. The energy density e*(J) is non-decreasing in J simply by concavity of the free energy. 
The availability of the graphical representation allows us to make general statements about the 
phase-structure of these systems. In particular, in any d > 2 and for all q under consideration, 
there is a Jc = Jc{q, d) G (0, oo) such that mi,{J) > for J > Jc while mi,{J) = for J < J^, 
see [4,28]. Whenever mi,{Jc) > (which, by the aforementioned results [44^6], is known for 
q » 1), there are at least q + 1 distinct extremal, translation-invariant Gibbs states at J = Jc. 
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The mean-field free energy for the model without external field is best written in terms of 
components of m: If (xi, . . . , Xg) is a probabiUty vector, we express m as 

m = XiVi H h XqVq. (2.6) 

The interpretation of this relation is immediate: corresponds to the proportion of spins in the 
k-th spin-state. In terms of the variables in (2.6), the mean-field free-energy function is (to within 
a constant) given by 

g 

0j{m) = J2{-i4 + Xk'^ogXk). (2.7) 

k=i 

In (2.7) we have for once and all set the external field b to zero and suppressed it from the notation. 

It is well-known (see [41, 54] and also Lemma 4.4 of the present paper) that, for each q > 3, 
there is a Jmf £ (2, q) such that <Pj has a unique global minimizer m = for J < Jmf, 
while for J > Jmf, there are q global minimizers which are obtained by permutations of single 
(xi, . . . , Xq) with xi > X2 = ■ ■ ■ = Xq. To kccp the correspondence with mi,{J), we define the 
scalar mean-field magnetization mMF( J) as the maximal Euclidean norm of all global minimizers 
of the mean-field free energy 0j{m). (In this parametrization, the asymmetric global maxima 
will be given by xi = ^ + mMF( J) and X2 = ■ ■ ■ = Xq = ^ — ^^mMF( J)-) Then mMF( J) is 
the maximal positive solution to the equation 

J —2-- rn 1 
n e — 1 

^ = • (2-8) 



1 J ^-rp III . 1 

e + q-l 

In particular, J i-^ mMv{J) is non-decreasing. We note that the explicit values of the coupling 
constant Jmf and the magnetization rnc = TnMF( Jmf) at the mean-field transition are known: 

Jmf = 2 ^ log(g - 1) and mc = (2.9) 
q-2 q 

see e.g. [54]. Thus, the mean-field transition is first-order for all q> 2. 

Our main result about the Potts model is then as follows: 

Theorem 2.1 (Potts model) Consider the q-state Potts model on Z'^ and let m^i,( J) be its scalar 
magnetization. For each q > 3, there exists a Jt = Jt(g, d) and two numbers ei = ei(d, J) > 
and €2 = 62 (c?) > satisfying ei{d, J) — > 0, uniformly on finite intervals of J, and e2{d) as 
d ^ oo, such that the following holds: 

m^{J)<ei for J < (2.10) 

and 

|m^(J)-mMF(J)| <ei for J > J,. (2.11) 

Moreover, 

|Jt-JMF|<e2- (2.12) 

In particular, both the magnetization mi,{J) and the energy density e*(J) undergo a jump at 
J = Jt whenever d is sufficiently large. 
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The jump in the energy density at Jt immediately implies the existence of at least q + 1 distinct 
extremal Gibbs measures at J = Jt. However, the nature of our proofs does not permit us to 
conclude that m^(J) = for J < Jj nor can we rule out that J) undergoes further jumps 
for J > Jf (Nonetheless, the jumps for J > Ji would have to be smaller than 2ei{d).) Un- 
fortunately, we can say nothing about the continuous-g variant of the Potts model — the random 
cluster model — for non-integer q. In this work, the proofs lean too heavily on the spin repre- 
sentation. Furthermore, for non-integer q, the use of our principal tool, reflection positivity, is 
forbidden; see [8]. 

We also concede that, despite physical intuition to the contrary, our best bounds on 62 (d) 
and ei {d, J) deteriorate with increasing q. This is an artifact of the occurrence of the single-spin 
space dimension on the right-hand side of (1.12). (This sort of thing seems to plague all exist- 
ing estimates based on reflection positivity.) In particular, we cannot yet produce a sufficiently 
large dimension d for which the phase transition in all (q > 3)-state Potts models would be 
provably first order. 

2.2 Cubic model. 

Our second example of interest is the r-component cubic model. Here the spins Sx are the unit 
vectors in the coordinate directions of M'', i.e., if e^ are the standard unit vectors in W, then 

n = {±ek: k = l,...,r}. (2.13) 

The Hamiltonian is given by (1.1), with the inner product given by the usual dot product in W 
and the a priori measure given by the uniform measure on $7. As in the last subsection, we set 
6 = and suppress any 6-dependence from the notation. We note that the r = 1 case is the Ising 
model while the case r = 2 is equivalent to two uncoupled Ising models. 

The cubic model was introduced (and studied) in [42, 43] as a model of the magnetism in 
rare-earth compounds with a cubic crystal symmetry. There it was noted that the associated 
mean-field theory has a discontinuous transition for r > 4, while the transition is continuous for 
r = 1, 2 and 3. The mean field theory is best expressed in terms of the collection of parameters 
y = {yi, . . . ,yr) and p. = (/Ui, . . . , /v,,), where y^ stands for the fraction of spins that take the 
values zbcfc and pkUk is the magnetization in the direction e^. In this language, the magnetization 
vector can be written as 

m = j/i/xiei H h yriJ-r^r- (2.14) 

To describe the mean-field free-energy function, we define 

r 

{y, A) = XI (y^ + &2jyiiJ.k)) , (2. 15) 

fe=i 

where ©j{ij) denotes the standard Ising mean-field free energy with bias fi; i.e., the quantity 
in (2.7) with q = 2, xi = ^(1 + ji) and X2 = \{1 — p). Then <Pj(m) is found by minimizing 

K^j \y, jl) over all allowed pairs {y, fl) such that (2.14) holds. 

As in the case of the Potts model, the global minimizer of 0j{m) will be a permutation of a 
highly-symmetric state. However, this time the result is not so well known, so we state it as a 
separate proposition: 
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Proposition 2.2 Consider the r-component cubic model. For each J > 0, the only local minima 
of are m = or m = zbmMpefc, k = 1, . . . ,r, where uimf = '>tt-mf{J) is the maximal 
positive solution to the equation 

sinh Jm 
r — 1 + cosh Jm 

Furthermore, there is a Jmf £ (0, oo) such that the only global minimizers of(pj{m) are m = 
for J < Jmf and m = ±mMF{J)^k, k = 1, . . . ,r, (with niuFiJ) > O)for J > Jmf- 

For a system on Z'^, the scalar magnetization is most conveniently defined as the norm of 
(So) J, optimized over all translation-invariant Gibbs states for the couphng constant J. The 
energy density e*( J) is defined using the same formula as for the Potts model, see (2.4). 

Our main result about the cubic model is then as follows: 

Theorem 2.3 (Cubic model) Consider the r-state cubic model on Z'^ and let m*(J) be its 

scalar magnetization. Then for every r > A, there exists a = Jt{q,d) and two numbers 
ei = ei{d, J) > and €2 = ^2{d) > satisfying €i{d, J) 0, uniformly on finite intervals of J, 
and 62 (d) — > aj d — > 00, such that the following holds: 

TO*(J) < ei for J<Ji (2.17) 

and 

\m^{J) - mMF{J)\ < ei for J > J,. (2.18) 

Moreover, 

|J-JMF|<e2. (2.19) 

In particular, both the magnetization mi,{J) and the energy density e^(J) undergo a jump at 
J = Jt whenever d is sufficiently large. 

As in the case of the Potts model, our technique does not allow us to conclude that Jt is the only 
value of J where the magnetization undergoes a jump. In this case, we do not even know that the 
magnetization is a monotone function of J; the conclusions (2.17-2.18) can be made because we 
know that the energy density is close to ^m*(J)^ and is (as always) a non-decreasing function 
of J. Finally, we also cannot prove that, in the state with large magnetization in the direction ei, 
there will be no additional symmetry breaking in the other directions. Further analysis, based 
perhaps on graphical representations, is needed. 

2.3 Nematic liquid-crystal model. 

The nematic models are designed to study the behavior of hquid crystals, see the monograph [25] 
for more background on the subject. In the simplest cases, a hquid crystal may be regarded as a 
suspension of rod-like molecules which, for all intents and purposes, are symmetric around their 
midpoint. For the models of direct physical relevance, each rod (or a small collection of rods) 
is described by an three-dimensional spin and one considers only interactions that are (globally) 
0(3)-invariant and invariant under the (local) reversal of any spin. The simplest latticized version 



14 



MAREK BISKUP AND LINCOLN CHAYES 



of such a system is described by the Hamiltonian 

PH{s) = -^Y.(^x-Syf, (2.20) 

with Sx a unit vector in and x G Z*^ with d = 2 or d = 3. We wiU study the above Hamiltonian, 
but we will consider general dimensions d (provided d > 3) and spins that are unit vectors in 
any (provided N > 3). 

The Hamiltonian (2.20) can be rewritten into the form (1.1) as follows [25]: Let ¥,fi be the 
space of all traceless N x N matrices with real coefficients and let $7 be the set of those matrices 
Q = {Qa,i3) S for which there is a unit vector inv = (va) G M-^ such that 

QaP = VcxVp - j^Sap, a, (5 = I, . . . , N . (2.21) 
Writing for the matrix arising from the spin via (2.21), the interaction term becomes 

{s^.Syf = Tv{Q,Qy) + ^^. (2.22) 

Now En is a finite-dimensional vector space and (Q, Q') = Tv{QQ') is an inner product on En, 
so (2.20) indeed takes the desired form (1.1), up to a constant that has no relevance for physics. 

The a priori measure on J) is a pull-back of the uniform distribution on the unit sphere in R^. 
More precisely, if v is uniformly distributed on the unit sphere in M^, then Q G is a random 
variable arising from v via (2.21). As a consequence, the a priori distribution is invariant under 
the action of the Lee group 0{N, R) given by 

Q.^g-^Q.g, geO{N,R). (2.23) 

The parameter signaling the phase transition, the so called order parameter, is "tensor" valued. 
In particular, it corresponds to the expectation of Qq. The order parameter can always be diago- 
nalized. The diagonal form is not unique; however, we can find an orthogonal transformation that 
puts the eigenvalues in a decreasing order. Thus the order parameter is effectively an A^-vector 
A = (Ai, . . . , Aat) such that Ai > A2 > • • • > \n- We note that, since each is traceless, 
we have J2k = 0- 

The previous discussion suggests the following definition of the scalar order parameter: For 
J > 0, we let A* (J) be the value of the largest non-negative eigenvalue of the matrix {Qq)j, opti- 
mized over all translation-invariant Gibbs states for the coupling constant J. As far as rigorous re- 
sults about the quantity \i,{J) are concerned, we know from [6] that (in d > 3) A^,( J) > once J 
is sufficiently large. On the other hand, standard high-temperature techniques (see e.g. [5,7, 17]) 
show that if J is sufficiently small then there is a unique Gibbs state. In particular, since this 
state is then invariant under the action (2.23) of the full 0(A'^, R) group, this necessitates that 
A*( J) = for J small enough. The goal of this section is to show that A* (J) actually undergoes 
a jump as J varies. 

The mean-field theory of the nematic model is formidable. Indeed, for any particular N it 
does not seem possible to obtain a workable expression for #j(A), even if we allow that the 
components of A have only two distinct values (which is usually assumed without apology in the 
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physics literature). Notwithstanding, this simple form of the vector minimizer and at least some 
of the anticipated properties can be estabUshed: 

Proposition 2.4 Consider the 0{N)-nematic model for N > 3. Then every local minimum 
of^j{\) is an orthogonal transformation of the matrix A = diag(A, ■ ■ ■ , —j^ri)> where 

A is a non-negative solution to the equation 



rdx(l-x^)Ve^-^(x^-^) 
Jo 



A = ^ . (2.24) 



I 



/o 

In particular, there is an increasing and right-continuous function J i— > Amf(</) such that the 
unique minimizer of <Pj{X) is A = Ofor J < Jmf, while for any J > Jmp, the function #j(A) is 
minimized by the orthogonal transformations of 

A = diag(^AMF(J), - ^ (2-25) 

At the continuity points of Xyiv ■ {Jmf, oo) [0, 1], these are the only global minimizers of<P j. 

Based on the pictorial solution of the problem by physicists, see e.g. [25], we would expect that 
J 1-^ Amf(«/) is continuous on its domain and, in fact, corresponds to the maximal positive solu- 
tion to (2.24). (This boils down to showing certain convexity-concavity property of the function 
on the right-hand side of (2.24).) While we could not establish this fact for all iV > 3, we were 
successful at least for N sufficiently large. The results of the large- analysis are summarized 
as follows: 

Proposition 2.5 Consider the 0{N)-nematic model for N > 3 and let (J) be the maximal 
positive solution to (2.24). Then there exists an Nq > 3 and, for each N > Nq, a number 
Jmf = Jmf{]^) & (0, oo) such that for each N > Nq, the unique minimizer of ^ is \ = 
for J < Jmf, while for any J > Jmf, the function ^j(A) is minimized only by the orthogonal 
transformations of (2.25), with Amf( J) > 0. 

The function J ^ ^mf {'^) continuous and strictly increasing on its domain and has the 
following large-N asymptotic: For all J > 2, 

lim >^ (JN) = J (1 + Vl - 4J-2) . (2.26) 
Moreover, there exists a (with J^^^ 2.455j such that 

lim:M^ = j(-). (2.27) 

N^oo iV 

Now we are ready to state our main theorem concerning 0(A^)-nematics. As can be gleaned 
from a careful reading, our conclusions are not quite as strong as in the previous cases (due the 
intractability of the associated mean-field theory). Nevertheless, a bona fide first-order transition 
is established for these systems. 
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Theorem 2.6 (Nematic model) Consider the 0{N)-nematic model with the Hamiltonian (2.20) 
and J > 0. For each N > S, there exists a non-negative function J i— > X^{J), a constant 

Ji = Jt{N, d) and two numbers ei = ei(d, J) > and €2 = €2{d) > satisfying €i{d, J) — >■ 0, 
uniformly on finite intervals of J, and e2{d) ^ as d ^ 00, such that the following holds: 

For all J > 0, the matrix A = diag(AJ^p( J), — ^^^rp, • • • , — -^rrp) ^ local minimum of<Pj. 
Moreover, we have the bounds 

A* (J) < ei for J <Jt (2.28) 

and 

|A*(J)-Aj^(J)| <ei for J > J,. (2.29) 

Furthermore, 

|Jt-JMF|<e2. (2.30) 

In particular, A^( J) > x > 0/or all J > Ji and all N > S and both the order parameter and 
the energy density e*( J) undergo a jump at J = Jt, provided the dimension is sufficiently large. 

The upshot of the previous theorem is that the high-temperature region with A = and the 

low-temperature region with A / (whose existence was proved in [6]) are separated by a first- 
order transition. However, as with the other models, our techniques are not sufficient to prove 
that A is exactly zero for all J < Jt, nor, for J > Jt, that all states are devoid of some other 
additional breakdown of symmetry. Notwithstanding, general theorems about Gibbs measures 
guarantee that, a jump of J A^(J) at J = Jt implies the coexistence of a "high-temperature" 
state with various symmetry-broken "low-temperature" states. 



3. Proofs of mean-fteld bounds 

3.1 Convexity estimates. 

In order to prove Theorem 1.1, we need to recall a few standard notions from convexity theory 
and prove a simple lemma. Let =2/ C M" be a convex set. Then we define the affine hull of ,s/ by 
the formula 

affj^ = {Xx + {l-\)y:x,ye^,XeR}. (3.1) 

(Alternatively, aff jz/ is an smallest affine subset of containing jz/.) This concept allows us to 
define the relative interior, ri of jz/ as the set of all x G =2/ for which there exists an e > 
such that 

y G aff jj/ & \y - x\<€ =^ y e £/. (3.2) 

It is noted that this definition of relative interior differs from the standard topological definition. 
For us it is important that the standard (topological) closure of ri is simply the standard closure 
of jz/. We refer to [51] for more details. 

Lemma 3.1 For each m G ri {m' G : S{m') > —00}, there exists a vector h e such 
that VG{h) = m. 
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Results of this sort are quite well known; e.g., with some effort this can be gleaned from 
Lemma 2.2.12 in [16] combined with the fact that the so called exposed points of S{m) can be 
realized as VG{h) for some h. For completeness, we provide a full derivation which exploits the 
particulars of the setup at hand. 

Proof. Let abbreviate {m' G Kq : S{m') > — oo} and let m G ri*^. Let us define the set 
V = {m' — m : m' € aff ^}. It is easy to see that V is in fact the affine hull of the shifted set 
— m and, since G V, it is a closed Unear subspace of Eq. First we claim that the infimum in 
(1.4) can be restricted to h eY. Indeed, if /i, a G Eq, then the convexity of /i G{h) gives 

G{h + a)-{h + a,m)> G{h) - {h, m) + (a, VG{h) - m) (3.3) 

for any m. This impUes that VG{h) has a finite entropy, i.e., VG{h) G for any h G Eq. Now 
let m be as above and a G V^. Then an inspection of the definition of V shows that the last term 
in (3.3) identically vanishes. Consequently, for the infimum (1.4), we will always be better off 
with heW. 

Let hk G V be a minimizing sequence for <S'(m); i.e., G{hk) — {hk, m) S{m) as k ^ oo. 
We claim that hj. contains a subsequence tending to a finite limit. Indeed, if on the contrary 
hk = \hk\ oo we let be defined by hk = hkTk and suppose that Tk ^ t (at least along a 
subsequence), where \t\ = 1. Now since m G ri and r G V, we have m + er G aff for all 
e and, by (3.2), m + er G for some e > sufficiently small. But we also have 

G{hk) - {hk, m + er) = G{hk) - {hk, m) - ehk{Tk, r), (3.4) 

which tends to the negative infinity because (rfc, r) — > 1 and hk — *• oo. But then S{m + er) = 
— oo, which contradicts that m+er G Thus hk contains a converging subsequence, hk^ h. 
Using that h is an actual minimizer of G{h) — {h, m), it follows that VG{h) = m. □ 

Now we are ready to prove our principal convexity bound: 

Proof of Theorem 1.1. Recall that Fmf(</, denotes the infimum of (pj^im) over all m G 
Conv(ri). As a first step, we will prove that there is a constant C < oo such that for any finite 
A C Z*^ and any boundary condition S'gj^, the partition function obeys the bound 

ZAiS'Q^)>e-\^\^^^''''^-^\^^\, (3.5) 

where |A| denotes the number of sites in A and \dA\ denotes the number of bonds of Z'^ with one 
end in A and the other in Z*^ \ A. (This is an explicit form of the well known fact that the free 
energy is always lower than the associated mean-field free energy, see [19,52].) 
To prove (3.5), let Ma denote the total magnetization in A, 

Ma = J2Sx, (3.6) 

and let (— be the a priori state in A tilted with a uniform magnetic field h, i.e., for any 
measurable function / of the configurations in A, 

{f)(^l = e-\^\^(^\fe(^'^-^)o. (3.7) 
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Fix an h. € and let rrih = \/G{h). By inspection, VG{h) = {Sx)l^^ for all x € A. Then 

which using Jensen's inequaUty gives 

Zj^iS'g^) > exp{|A|(G(h) - {h,mh)) - {(3H{Sj,\S'9^))^^^]. (3.9) 

To estimate the expectation of PH{SA\Sgf^), we first discard (through a bound) the boundary 
terms and then evaluate the contribution of the interior bonds. Since the number of interior bonds 
in A is more than d|A| — \dA\, this gets us 

-{PH{Sa\S'9^))o^^ > ~\mh\' - C\dA\. (3.10) 

Now G{h) - ih,mh) > S{mh), so we have ^a(S^a) > e-l^l'^^.f^^")-^!^^!. But Lemma 3.1 
guarantees that each m with S{m) > — oo can be approximated by a sequence of rrih with 
h G Kq, so the bound (3.5) follows by optimizing over h G E^- 

Next, let uj ij be an infinite volume Gibbs state and let (— ) j,6 denote expectation with respect 
to z/j b. Then we claim that 

^\A\Gih) ^ ^^ih,M^)+pH^iS^\SeA)z^(SQj,))j^^. (3.11) 

(Here Sa, resp. Sqa denote the part of the same configuration S inside, resp., outside A. Note 

that the relation looks trivial for h = 0.) Indeed, the conditional distribution in z/j 5 given that the 

(S') 

configuration outside A equals S is v)^ % as defined in (1.2). But then (1.2) tells us that 

Je(f^'^^)+f^H^(S^\S')zA{S')uf\dSA) = y e('^'^^)M(d5A) = e'^I^C^). (3.12) 

The expectation over the boundary condition S' then becomes irrelevant and (3.11) is proved. 

Now suppose that uj^b is the Z'^-translation and rotation invariant Gibbs measure in question 
and recall that = {So)j^b^ where {—)j,b denotes the expectation with respect to uj^b- To prove 
our desired estimate, we use (3.5) on the right-hand side of (3.11) and apply Jensen's inequaUty 
to get 

el^l«('^) > e^p{{{h,MA) + PHA)j^f]e-\''\^-^(-''''^-''\^''\. (3.13) 
Using the invariance of the state uj^b with respect to the translations and rotations of Z*^, we have 

{{h,MA))j^^=mh,m^) (3.14) 

while 

mA)j,b > -\A\^{iSo,S,))j^,-\A\ib,m^) - C'\dA\, (3.15) 

where C is a constant that bounds the worst-case boundary term and where x stands for any 
neighbor of the origin. By plugging these bounds back into (3.13) and passing to the thermody- 
namic limit, we conclude that 

-G{h) + {h-b, - ^{{So, S^))j^^ < Fmf(J, b). (3.16) 
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Now optimizing the left-hand side over h e allows us to replace —G{h) + {h, m^) by 
—S{mi,). Then the bound (1.9) follows by adding and subtracting the term ^|m^tp on the left- 
hand side. □ 



3.2 Infrared bound. 

Our proof of the Key Estimate (and hence the Main Theorem) requires the use of the infrared 
bounds, which in turn are derived from reflection positivity. The connection between infrared 
bounds and reflection positivity dates back (at least) to [18, 22-24]. However, the present for- 
mulation (essentially already contained in [12, 24, 41]) emphasizes more expUcitly the role of 
the "k = 0" Fourier mode of the two-point correlation function by subtracting the square of the 
background average. 

Reflection positivity is greatly facilitated by first considering finite systems with periodic 
boundary conditions. If it happens that there is a unique Gibbs state for parameter values J 
and h then the proof of the Key Estimate is straightforward — there is no difficulty with putting 
the system on a torus and taking the hmit. In particular, the Key Estimate amounts (more or 
less) to Corollary 2.5 in [24]. But when there are several infinite- volume Gibbs states, we can 
anticipate trouble with the naive hmits of the finite-volume torus states. Fortunately, Gibbsian 
uniqueness is not essential to our arguments. Below we list two properties of Gibbs states which 
allow a straightforward proof of the desired infrared bound. Then we show that in general we can 
obtain the infrared bound for states of interest by an approximation argument. 

Property 1 An infinite-volume Gibbs measure vj i, (not necessarily extremal) for the interaction 
(1.1) is called a torus state if it can be obtained by a (possibly subsequential) weak limit as 
L ^ oo of the Gibbs states in volume [—L^Vf n Z*^, for the interaction (1.1) with periodic 
boundary conditions. 

Given J and h, we let ^( J, b) denote the subset of Conv(r2) containing all magnetizations 
achieved by infinite-volume translation-invariant Gibbs states for the interaction (1.1). Next, 
recall the notation M"a from (3.6) for the average magnetization in A c Z*^. 

Property 2 An infinite-volume Gibbs measure uj ij (not necessarily extremal) for the interaction 
(1.1) w said to have block-average magnetization m if 

lim f = m, u J b-almost surely . (3.17) 

AyZ'' \A\ 

Here the convergence A y Z'^ is along the net of all the finite boxes A C Z'^ with partial order 
induced by set inclusion. (See [26] for more details.) 

Our first goal is to show that every torus state with a deterministic block-average magnetization 
satisfies the infrared bound. Suppose d > 3 and let denote the Fourier transform of the 
inverse lattice Laplacian with Dirichlet boundary condition. In lattice coordinates, has the 
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representation 



J[-n,7T]'' {^T^r D{k) 

where -D(fe) = 1 — ^ X^j=i cos(fcj). Note that the integral converges by our assumption that 

d>3. 

Lemma 3.2 Let d > 3 and suppose that vj^b is a Gibbs state for interaction (1.1) satisfying 
Properties 1 and 2. Let (— ) j,6 denote the expectation with respect to vj^b cmd let m denote the 
value of magnetization in ujb- Then for all {vx)xeZ'i- ^^^^ G M and YIxgZ'' b^l < 

XI VxVy{{Sx-m,Sy-m))j^^<nJ~^ ^ v^Vy D'^ {x , y) . (3.19) 



Here n denotes the dimension o/Eq. 

Proof Let Al = [-L,L]'^ n Z'^ and let i^'f'^ be the finite-volume Gibbs state in Al for the 
interaction (1.1) with periodic boundary conditions. Let A^ = {{2L+1 

rii < L} denote the reciprocal lattice. Let {wx)xgAl t>e a collection of vectors from Eq satisfying 
that Wx ^ for only a finite number of x G Z*^ and J2xeAL ~ ^" (~) J6 denote the 
expectation with respect to Uj^jj. Then we have the infrared bound [22-24], 

{iwx,Sx){wy,Sy))^j'^l<J-^ iwx,Wy)Dl\x,y) (3.20) 

where 

D7\x,y) = -^ y -J—e'''^''-y\ (3.21) 

Now, let ei, . . . ,e„ be an orthogonal basis in Eq and choose Wx = Wx^^t, where {wx)x^id- is 
such that Wxi^Q only for a finite number of x G Z'^ and 

Y^wx = (). (3.22) 

Passing to the limit L ^ 00 in such a way that u'^j'^ converges to the state z/j^j,, and then summing 
over ^ = 1, . . . , n gets us the bound 

X WxWy{{Sx,Sy))j^^<nJ~'^ X WxWyD~^{x,y). (3.23) 



So far we have (3.23) only for {wx) with a finite support. But, using that fact that both quantities 
D~^{x,y) and ((/S^, <Sy)) are uniformly bounded, (3.23) is easily extended to all absolutely- 
summable {wx)xezd (i-e., those satisfying Ylxe'z^ l^^l ^ which obey the constraint (3.22). 
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Let {vx) be as specified in the statement of the Lemma and let a = J2xeZ'i '^x- Fix K, let h.K 
be as above and define (ifi^"* ) by 

= vx- 1 {xeAx} • (3-24) 

Clearly, these {wi^^) obey the constraint (3.22). Our goal is to recover (3.19) from (3.23) in the 
K ^ oo limit. Indeed, plugging this particular into (3.23), the left hand side opens into 

four terms. The first of these is the sum of VxVy{{Sx, Sy))j^b, which is part of what we want in 
(3.19). The second and the third terms are of the same form and both amount to 

a^Vx^{xeAK}{i^x,Sy))J,^ = a(J2vx[Sx,-r^ ^ Sy))^^- (3.25) 

By our assumption of a sharp block-average magnetization in uj t,, the average of the spins in 
can be replaced, in the K — oo Umit, by m. Similarly, we claim that 

Jl^^U^ E {iSx,Sy))j^^=\m\\ (3.26) 

x,yeAK 

SO, recalling the definition of a, the left hand side is in a good shape. 

As for the right-hand side of (3.23) with (wx) = {wi^''), here we invoke the fact that (for 

d > 3) 

J™ TTl Yl D-\x,y) = 0, (3.27) 

uniformly in y G Z'^. The claim therefore follows. □ 

Next we show that for any parameters J and b, and any m^, G J, b), we can always find 
a state with magnetization my^ that is a limit of states satisfying Properties 1 and 2. 

Lemma 3.3 For all J > 0, all b & and all G ^^[J, b), there are sequences (Jk), (bk) 
and (rrik) with Jk J, b^ b, ruk m,-^ and .y^{ Jk, bk) = {"rrik}. In particular, there is a 
sequence of infinite-volume Gibbs measures satisfying Properties 1 and 2, which weakly 

converge (possibly along a subsequence) to a measure uj b with magnetization m*. 

Proof. The proof uses a little more of the convexity theory, let us recapitulate the necessary 
background. Let /: M" — >^ (— oo, oo) be a convex and continuous function. Let (•, •) denote the 
inner product in R". For each x G M", let S{x) be the set of all possible limits of the gradients 
Vf{xk) for sequences x^ G such that x^ ^ x as k ^ oo. Then Theorem 25.6 of [51] says 
that the set of all subgradients df{x) of / at x, 

df{x) = {a G M": /(y) - fix) > {y - x,a), y G M"}, (3.28) 

can be written as 

df{x) = Conv(S'(x)) (3.29) 

where Conv(5(x)) is the closed, convex hull of S{x). (Here we noted that since the domain of / 
is all of R", the so called normal cone is empty at all x G M".) But S{x) is closed and thus 
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Conv(5'(x)) is simply the convex hull of ^(a;). Now, by Corollary 18.3.1 of [51], we also know 
that if S C M" is a bounded set of points and C is its convex hull (no closure), then every extreme 
point of C is a point from S. Thus, we conclude: every extreme point of df{x) lies in S{x). 

Now we can apply the above general facts to our situation. Let F( J, h) be the infinite-volume 
free energy of the model in (1.1). Noting that F(J, h) is defined for all J G R and all h G 
Eq, the domain of F is M x Eq. By well known arguments, F is continuous and concave. 
Moreover, a comparison of (1.11) and (3.29) shows that J^(J, 5) is — up to a sign change — 
the subdifferential of F at (J, 6). As a consequence of the previous paragraph, every extreme 
point [eyf,myf] G J^{J,b) is given by a limit limfe^oo[efc, ■m-fc], where [efc,mfc] are such that 
'^*{Jk,bk) = {[efe,mfc]} for some ^ J and hk b. But G .J^^,{J,b) implies that 
[e^, m^] is an extreme point of J(fi,{J, b) for some e^,, so the first part of the claim follows. 

To prove the second part, note that any infinite-volume limit of the finite-volume Gibbs state 
with periodic boundary condition and parameters Jk and bk must necessarily have energy density 
Cfc and magnetization m^. By compactness of the set of all Gibbs states (which is ensured by 
compactness of $7), there is at least one (subsequential) hmit {—)j,b of the torus states as Jk —>■ J 
and bk — > b, which is then a translation-invariant Gibbs state with parameters J and 6 such that 



where x and y is any pair of nearest neighbors of Z*^. However, the block-average values of both 
quantities must be constant almost-surely, because otherwise (— ) j,b could have been decomposed 
into at least two ergodic states with distinct values of energy-density/magnetization pair, which 



We note that the limiting measure is automatically Z'^-translation and rotation invariant and, in 
addition, satisfies the block-average property. But, in the cases that are of specific interest to the 
present work (i.e., when J, 6) contains several elements), there is little hope that such a state 
is a torus state. Nevertheless, we can prove: 

Corollary 3.4 Let J > and b G En. Then for any rrii, G ^*( J, b), there exists a state uj^b 
with (block-average) magnetization m,-^ for which the infrared bound (3.19) holds. Moreover, the 
state vj ij is -translation and rotation invariant. 

Proof. For J = we obviously have a unique Gibbs state and the claim trivially holds. Otherwise, 
all of this follows from the weak convergence of the vj^,bk discussed above. □ 

3.3 Proof of Main Theorem. 

Now we have all the ingredients ready to prove Lemma 1.3: 

Proof of Lemma L3. Fix G ^*( J, b) and let vj^b be the state described in Corollary 3.4. To 
prove our claim, it just remains to choose {vx) as follows: 




(3.30) 



would in turn contradict that [ey^, rrii,] is an extreme point of J, b). 



□ 




if |x| = 1, 
otherwise, 



(3.31) 



and recall the definition of la from (1.13). 



□ 
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Having established Lemma 1.3, we are ready to give the proof of the Key Estimate: 

Proof of Key Estimate. Let J > and 6 G Eq. Let G ^*(^, b) and let {—)j,b be the state 
satisfying (1.15) and (1.17). Our goal is to prove the bound (1.16). To that end, let mo = mo{S) 
denote the spatially averaged magnetization of the neighbors of the origin. The rotation symmetry 
of the state (— ) j,6 then implies 

((5,,So)),,,= ((mo,5o)>,,^,. (3.32) 

Next, conditioning on the spin configuration in the neighborhood of the origin, we use the DLR 
condition for the state (— ) j,b which results in 

((mo, 5o)> = ((mo, VG(Jmo + b)))j^^. (3.33) 

Finally, a simple calculation, which uses the fact that m* = (S'o) j,6 = (mo) j,6 = (VG( Jmo + 
6)) j b, allows us to conclude that 

((mo, VG( Jmo + &))) - 

= ( (mo - m*, VG( Jmo + 6) - VG( Jm* + b))) . (3.34) 

\ / J,b 

To proceed with our estimates, we need to understand the structure of the double gradient of 
function G{h). Recall the notation (— )o,/i for the single-spin state tilted by the external field h. 
Exphcitly, for each measurable function / on Q, we have {f{S))o^h = e~'^^'^\f{S)e^^'^^)o. 
Then the components of the double gradient correspond to the components of the covariance 
matrix of the vector-valued random variable S. In formal vector notation, for any a G Eq, 

(a, VfG{h) = {{a,S- {S)o,hf)o,h- (3-35) 
Pick ho,hi eEci. Then we can write 

{hi-ho,WG{hi)-WG{ho))= ['dx({hi-ho,S-{S)o,h,y) , (3.36) 

where h\ = {\ — X)ho + Xhi. But the inner product on the right-hand side can be bounded using 
the Cauchy-Schwarz inequality, and since 

{\S - {S)oM\\h, ^ S^-^"^^ = ^^-^^^ 

we easily derive that 

{hi - ho, VG{hi) - VG{ho)) < K\hi - hoP- (3.38) 

This estimate in turn shows that the right-hand side of (3.34) can be bounded by KJ(|mo — 
But for this we have the bound from Lemma 1.3: (|mo — Tn*!^) j,b ^ nJ~^Id. Putting 
all the previous arguments together, (1.16) follows. □ 

Proof of Main Theorem. This now follows directly by plugging (1.16) into (1.9). □ 
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By and large, this section is devoted to the specifics of the three models described in Section 2. 
Throughout the entire section, we will assume that 6 = and henceforth omit b from the notation. 
We begin with some elementary observations which will be needed in all three cases of interest 
but which are also of some general applicability. 

4.1 General considerations. 

4.1.1 Uniform closeness to global minima. We start by showing that, for the systems under study, 
the magnetization is uniformly close to a mean-field magnetization. Let ./#mf(<^) denote the set 
of all local minima of #j. Obviously, if we know that the actual magnetization comes close to 
minimizing the mean-field free energy, in must be close to a minimum or a "near-minimum" of 
this function. A useful measure of this closeness is the following: For J G [0, cxd] and i? > 0, 
we let 

Dj{^) = supl dist(m, ^mf{J)) I m e Conv(l^), <?j(m) < Fmf(J) + i?}, (4.1) 

where Fmf{J) denotes the absolute minimum of However, to control the "closeness" we will 
have to make some assumptions about the behavior of the (local) minima of ^j. An important 
property ensuring the desired uniformity in all three models under study is as follows: 

Uniformity Property If J >0 and ifm G Conv(r2) is a global minimum of^j, then there is an 
e > and a continuous function : [ J — e, J + e] — > Conv (J7) such that limj'_> j m" (J') = m 
and (J') is a local minimum of^j' for all J' G [J — e, J + e\. 

In simple terms, the Uniformity Property states that every global minimum can be extended 
into a one-parameter family of local minima. Based on the Uniformity Property, we can state a 
lemma concerning the limit of Dj{'d) as ^9 | 0: 

Lemma 4.1 Suppose that 0j satisfies the above Uniformity Property. Then for all Jq > 0, 

lim sup Dji'd) = 0. (4.2) 
noo<J<Jo 

Proof. This is essentially an undergraduate exercise in compactness. Indeed, if the above fails, 
then for some e > 0, we could produce a sequence i and Jk G [0, Jq] such that 

Dj^{^k)>6e. (4.3) 

This, in turn, implies the existence of G Conv(il) such that 

dist(mjt,^MF(4)) > 3e while ^j^inik) < FupiJk) + i^k- (4.4) 

Let us use J and m to denote the (subsequential) limits of the above sequences. Using the 
continuity of 0j{m), to the right of the while we would have ^j{m) = Fmf{J) and m is thus a 
global minimum of By our hypothesis, for each k sufficiently large, there is a local minimum 
m^{Jk) of with m^{Jk) converging to m as A; ^ oo. Since is also converging to m, 
the sequences and ■m^{Jk) will eventually be arbittary close. But that contradicts the bound 
to the left of the while. □ 
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4.1.2 Monotonicity of mean-field magnetization. For spin systems with an internal symmetry 
(which, arguably, receive an inordinate share of attention), the magnetization usually serves as 
an order parameter. In the context of mean-field theory, what would typically be observed is an 
interval [0, Jmf] where m = is the global minimizer of while for J > Jmf, the function <Pj 
is minimized by a non-zero m. This is the case for all three models under consideration. (It turns 
out that whenever {S)q = 0, the unique global minimum of 0j for J sufficiently small is m = 0.) 

In order to prove the existence of a symmetry-breaking transition, we need to prove that the 
models under considerations have a unique point where the local minimum m = ceases the 
status of a global minimum. This amounts to showing that, once the minimizer of #j has been 
different from zero, it will never jump back to m = 0. In the mean-field theory with interac- 
tion (1.1), this can be proved using the monotonicity of the energy density; an analogous argument 
can be used to achieve the same goal for the corresponding systems on Z"'. 

Lemma 4,2 Let Ji < J2 and let mi be a global minimizer of <Pj^ and m2 a global minimizer 
of^j^. Then \mi\ < \m2\. Moreover, if J t-^ m(J) is a differentiable trajectory of local 
minima, then 

jj^j{rn{J))=~\m{J)\\ (4.5) 

Proof. The identity (4.5) is a simple consequence of the fact that, if m is a local minimum of ^j, 
then V#j(m) = 0. To prove the first part of the claim, let J, J' > and let m be a minimizer 
of Let Fysp{J) be the mean-field free energy. First we claim that 

Fmv{J) - Fmv{J') > -^^-^|m|2. (4.6) 
Indeed, since Fmf{J) = ^j(w), we have from the definition of <Pj that 

FMPiJ) = -^^^|m|2 + #j,(m). (4.7) 

Then the above follows using that ^j/{m) > Fmf(J'). Let Ji < J2 and mi and m2 be as 
stated. Then (4.6) for the choice J = J2, J' = Ji and m = m2 gives 

FMFiJ2) - FmAJi) > _1|^2|2. (4 8) 

J2 — Ji 2 
while (4.6) for the choice J = Ji, J' = J2 and m = mi gives 

Fmf(Ji)-Fmf(J2) (4.9) 
Ji — J2 2 

Combining these two bounds, we have |mi| < |m2| as stated. □ 

4.1.3 One-component mean-field problems. Often enough, the presence of symmetry brings 
along a convenient property that the multicomponent mean-field equation (1.7) can be reduced 
to a one-component problem. Since this holds for all cases under consideration and we certainly 
intend to use this fact, let us spend a few minutes formalizing the situation. 

Suppose that there is a non-zero vector a; G En such that VG(/ia;) is colinear with oj (and 
not-identically zero) for all h. As it turns out, then also VS'(maj) is coUnear with oj, provided 
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moj G Conv(r2). Under these conditions, let us restrict both h and m to scalar multiples of a; 
and introduce the functions 

g{h) = \u\-^G{hu) and s{m) = \u\-^S{mu). (4.10) 

The normalization by |u;|~^ ensures that s{m) is given by the Legendre transform of g{h) via the 
formula (1.4). Moreover, the mean-field free-energy function #j(mu;) equals the |a;p-multiple 
of the function 



4>j{'m) = - 




(4.11) 



The mean-field equation (1.7) in turn reads 

m = g'(Jm). (4.12) 

In this one-dimensional setting, we can easily decide about whether a solution to (4.12) is a local 
minimum of (/>j or not just by looking at the stability of the solutions under iterations of (4.12): 

Lemma 4.3 Let m be a solution to (4.12) and suppose (pj is twice continuously differentiable 
in a neighborhood ofm. If 

Jg"{Jm) < 1 (4.13) 

then m is a local minimum of(f)j. Informally, only "dynamically stable" solutions to the (on-axis) 
mean-field equation can be local minima of(f)j. 

We remark that the term "dynamically stable" stems from the attempt to find solutions to (4.12) 
by running the iterative scheme nik+i = g'{Jmk)- 

Proof. Let h and m be such that g'{h) = m, which is equivalent to h = s'{m). An easy 
calculation then shows that g"{h) = —{s"{m))~^. Suppose now that m is a solution to (4.12) 
such that (4.13) holds. Then h = Jm and from (4.13) we have 

s"{m) = -{g"{Jm)y^ <-J. (4.14) 

But that imphes 

(f)"j{m) = -J -s"{m)> -J + J = {), (4.15) 

and, using the second derivative test, we conclude that m is a local minimum of (/>,/. □ 

With Lemmas 4.1, 4.2 and 4.3 established, our account of the general properties is concluded 
and we can start discussing particular models. What follows in the next three subsections are the 
three respective models laid out in order of increasing difficulty. Our repeated — and not partic- 
ularly elegant — strategy will be to pound at the various models using internal symmetry as the 
mallet. The upshot is inevitably that at most one component becomes dominant while all other 
components act, among themselves, Uke a system at high temperature. Thus all subdominant 
components are equivalent and the full problem has been reduced to an effective scalar model. In 
short, there are some parallels between the various treatments. However, somewhat to our disap- 
pointment, we have not been able to find a unified derivation covering "all models of this sort." 
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4.2 Potts model. 

In order to prove Theorem 2.1, we need to establish (rigorously) a few detailed properties of 
the mean-field free-energy function (2.7). In the view of (2.6) we will interchangeably use the 
notations m and {xi,. . . , Xq) to denote the same value of the magnetization. 

Lemma 4.4 Consider the q-state Potts model with q > S. Let <Pj be the mean-field free-energy 
fimction as defined in (2.7). If m & Conv(Jl) is a local minimum of<Pj then the corresponding 
(xi, . . . , Xq) is a permutation of the probability vector {x\, . . . , x*) such that 

X\>X*2 = ■■■ = X*q. (4.16) 

Moreover, when x\ > X2, we also have 

Jx\>l> Jx^. (4.17) 

A complete proof of the claims in Lemma 4.4 was, to our best knowledge, first provided in [41]. 
(Strictly speaking, in [41] it was only shown that the global minima of <Pj take the above form; 
however, the proof in [41] can be adapted to also accommodate local minima.) We will present 
a nearly identical proof but with a different interpretation of the various steps. The advantage of 
our reinterpretation is that it is easily appUed to the other models of interest in this paper. 

Proof of Lemma 4.4. If m corresponds to the vector {xi, . . . , Xg), we let <P^j\xi, . . . , Xg) be 
the quantity 'Pj{m). Suppose that (.ti, . . . ,Xg) is a local minimum. It is easy to verify that 
{xi,. . . , Xq) cannot lie on the boundary of Conv(O), so Xfc > for all fc = 1, . . . , Pick any 
two coordinates — for simpUcity we assume that our choice is xi and X2 — and let y = 1 — {x^ + 
• • • + Xg), z\ = xi/y and Z2 = X2/y. (Note that y = xi -\- X2 and, in particular, y > 0.) 
Then we have 

0f{xi, ...,xq) = ~Jy^{zf + zi) + y{zi log zi + Z2 log Z2) + Rf{x3, Xq), (4.18) 

where R^j\x3, . . . ,Xq) is independent of zi and Z2. Examining the form of the free energy, we 
find that the first two terms are proportional to the mean-field free-energy function of the Ising 
(q = 2) system with reduced coupling Jy: 

0fix,, ...,xq) = y^fy{zi,Z2) + Rfixs, . . .,Xq). (4.19) 

Since the only z-dependence is in the first term, the pair {zi,Z2) must be a local minimum 
of regardless of what x^,. . . ,Xq look Uke. But this reduces the problem to the Ising model, 

about which much is known and yet more can easily be derived. The properties of ^^j\zi,Z2) 
we will need are: 

(2) 

(i) Jc = 2 is the critical coupling. For J < Jg, the free-energy function <Pj ' (zi , Z2) is lowest 

(2) 

when zi = Z2, while for J > J^, the free-energy function (zi, Z2) is lowest when 
p = \zi — Z2\ is the maximal (non-negative) solution to p = tanh(i Jp). 

(ii) Whenever J > Jc, the maximal solution to p = tanh(i Jp) satisfies J(l — p^) < 2, 
which implies that either Jzi > 1 and Jz2 < 1 or vice versa. 
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(2) 

(iii) For all J and zi > Z2, the mean-field free-energy function {zi,Z2) monotonically 
decreases a& p = z\ — Z2 moves towards the non-negative global minimum. 

All three claims are straightforward to derive, except perhaps (ii), which is established by noting 
that, whenever p > satisfies the (Ising) mean-field equation, we have 

Hence, if J > Jc and zi > Z2, then Jz2 = ^-'^(1 — p) < \J{^ — p^) < 1 and thus Jz\ > 1 
because J{zi + Z2) = J > Jc = 

Based on (i-iii), we can draw the following conclusions for any pair of distinct indices Xj 
and Xk'. If J{xj + Xk) < 2, then xj = Xk, because the (A:,j)-th Ising pair is subcritical, while 
if J{xj + Xk) > 2 then, using our observation (ii), either Jxj. > 1 and Jxj < 1 or vice versa. 
But then we cannot have Jx/- > 1 for more than one index k, because if Jxj. > 1 and Jxj > 1, 
we would have J{xj + xjc) > 2 and the {k,j)-th Ising pair would not be at a local minimum. 
All the other indices must then be equal because the associated two-component Ising systems are 
subcritical. Consequently, only one index from (xi, . . . ,Xq) can take a larger value; the other 
indices are equal. □ 

Proposition 4.5 Consider the q-state Potts model with q > 3. Let <Pj be the mean-field free- 
energy function as defined in (2.7). There there exist Ji and J2 = Q with Ji < J2 such that 

(1) m = is a local minimum of'Pj provided J < J2. 

(2) m = x*Vi -|- • • • -|- x*Vi with x* > X2 = ■ ■ ■ = x* is a local minimum of^j provided 
that J > Ji and x^ = ^ + m, where m is the maximal positive solution to the equation 
(2.8). 

(3) For all J > 0, there are no local minima except as specified in (1) and (2). 

Moreover, if Jmf is as in (2.9), then the unique global minimum of^j is as in (1) for J < Jmf 
while for J > Jmf the function <Pj has q distinct global minimizers as described in(2) . 

Proof of Proposition 4.5. Again, most of the above stated was proved in [41] but without the 
leeway for local minima. (Of course, the formulas (2.8) and (2.9) date to an earUer epoch, see 
e.g. [54].) What is not either easily derivable or already proved in [41] amounts to showing that 
if m is a "dynamically stable" solution to (2.8), the corresponding m = x\v\ + ■ ■ ■ + x*vi as 
described in (2) is a local minimum for the full 0j{m). The rest of this proof is spent proving 
the latter claim. 

We first observe that for the set 

U{x) = (m = {x,X2, ... ,Xq): Jx^ < 1, A; = 2, . . . , g} (4.21) 

the unique (strict) global minimum of # j occurs at 

m(x) = (x,iEf,---,^)- (4.22) 

Indeed, otherwise we could further lower the value of by bringing one of the ( A;)-th Ising 
pairs closer to its equiUbrium, using the properties (ii-iii) above. Now, suppose that m satisfying 
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(2.8) is "dynamically stable" in the sense of Lemma 4.3. By (4.17) we have that the correspond- 
ing X* = i + m satisfies Jxi > 1 while the common value of for A; = 2, . . . , g is such 
that Jx^ < 1. Suppose that the corresponding m is not a local minimum of the full <Pj. Then 
there exists a sequence (m^) tending to m such that ^j{mk) < 0j{m). But then there is also 
a sequence m'^ such that <?j(m^) < ^j(m) where each now takes the form (4.22). This 
contradicts that the restriction of #j to the "diagonal," namely the function ^j(m), has a local 
minimum at m. □ 

Now we are ready to prove our main result about the g-state Potts model. 
Proof of Theorem 2.1. By well known facts from the FK representation of the Potts model, the 
quantities e^{J) and m^{J) arise from the pair [e^, m^] corresponding to the state with constant 
boundary conditions (the wired state). Therefore, [e^,m^] is an extreme point of the convex 
set J(fi,{J) and G ^*(J) for all J. In particular, the bound (1.12) for can be used 
without apology. 

Let 5d be the part of the error bound in (1. 12) which does not depend on J. Explicitly, we have 
= -^{q — because k = {q — l)/q and dim Kq = q — 1. Since /(^ ^ as d — oo, we 

have Sd ^ as d ^ oo. Let us define 

ei = ei{d,J)= sup Dj,{J5d), (4.23) 

0<,/'<J 

where Dj is as in (4.1). It is easy to check that the Uniformity Property holds. Lemma 4.1 then 
guarantees that every (extremal) physical magnetization G ^*(J) has to lie within ei from 
a local minimum ^j. Since the asymmetric minima exist only for J > Ji > while m = 
is a local minimum only for J < J2 = q, we have m,i,{J) < ei for J < Ji, while |m^(J) — 
"^mf(<^)| < ei for J > J2. But from the FKG properties of the random cluster representation 
we know that J h- > mi,{J) is non-decreasing so there must be a point, Jt G (Ji, J2], such that 
(2.10-2.11) hold. 

It remains to show that | Jt — Jmf| tends to zero as d — > 00. For J G (Ji, J2), let ^s{J), 
resp., ^Pa{J) denote the value of <Pj at the symmetric, resp., asymmetric local minima. The 
magnetization corresponding to the asymmetric local minimum exceeds some x > throughout 
(Ji, J2). Integrating (4.5) with respect to J and using that (^sC-Aif) = ¥'a(-'mf) then gives 
us the bound 

l^s(^) - fA{J)\ > \xV - Jmf|. (4.24) 

However, in the ei -neighborhood Ws(ei) of the symmetric minimum, we will have 

\^j{m)-vs{J)\<€iK, (4.25) 

where K is a uniform bound on the derivative of <Pj(m) for m G Ws(ei) and J G (Ji, J2)- 
Since the asymmetric minima are well separated from the boundary of Conv(Q) for J G ( Ji, J2), 
a similar bound holds for the ei -neighborhood of the asymmetric minimum. Comparing (4.24— 
4.25) and (1.12), we find that if 

^xV - -^ifI - 2eii^ > J5d, (4.26) 
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no value of magnetization in the ei -neighborhood of the local minima with a larger value of #j 
is allowed. In particular, | J, — JmfI < £2 where 62 = e2{d) tends to zero as d ^ 00. □ 

4.3 Cubic model. 

Our first goal is to prove Proposition 2.2. We will begin by showing that the local minima of #j 

(r) 

and Kj are in one-to-one correspondence. Let us introduce the notation 

r 

X = [{y,fi): < 1, yj > 0, ^ % = l} (4.27) 
and let X(m) denote the subspace of X where m = yifii -I- ■ ■ ■ -I- yj-Mr- 

Lemma 4.6 Let m G Conv(J7) be a local minimum of^j. Then there exists a {y, p) G X{rn) 

(r) 

which is a local minimum ofKj (as defined in (2.15)j. 

Proof. Let m be a local minimum of <Pj. Since X{m) is compact and Kj is continuous on X, 
the infimum 

#j(m)= inf Kf{y,Ji) (4.28) 

is attained at some (y, p) G X{rn). We claim that this (y, Jl) is a local minimum of K^p. Indeed, 
if the opposite is true, there is a sequence (y^, Jx^) G X converging to (y, Jx) such that 

KP{yk,i2k) < KP{y,i2) = ^j{m). (4.29) 

Now, (y, p,) was an absolute minimum of on X(m), so {yk,Pk) -^(m) and the magneti- 
zation mfc corresponding to {yk,Pk) is different from m for all k. Noting that 

<Pj{mk)<KP{yk,Pk) (4.30) 

and combining (4.29^.30), we thus have 'Pj{mk) < ^j{m) for all k. But rrik tends to m in 
Conv(r2), which contradicts the fact that m is a local minimum of □ 

Lemma 4.6 allows us to analyze the local minima in a bigger, simpler space: 

Lemma 4.7 Let [y, p) be the quantity in (2.15). Then each local minimum of KP{y, p) 
is an index-permutation of a state (y, p) with yi > y2 = ■ ■ ■ = Ur cind 1^2 = ■ ■ ■ = fJ^ = 0- 
Moreover, ifyi > y2, then jii ^ 0. 

Proof. Let (y, p) be a local minimum of such that yi > y2 > ■ ■ ■ > Vr and fix a A; between 1 
and r. We abbreviate y = yu + Uk+i and introduce the variables zi = yk/y, Z2 = yk+i/y, 
vi = fik and 1/2 = fik+i- Then 

K^iy, p) = yKf^iz, v) + i?, (4.31) 

where (z, v) is the mean-field free energy of an r = 2 cubic model with coupUng con- 
stant Jy, and i? is a quantity independent of (z, v). As was mentioned previously, the r = 2 
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cubic model is equivalent to two decoupled Ising models. Thus, 

Kf^{z,i?) = ejyip,) + Ojy{p2), (4.32) 
where pi and p2 are related to zi, Z2, i^i and 1/2 via the equations 

Zi = i(l+pi/J2), ^;ii^l = 5(P1+P2), 

Z2 = 2y^-PlP2), Z2V2 = 2\P'^~PV- 

Now, the local minima of 0j{p) occur at p = ±p(J), where p(J) is the largest non-negative 
solution to the equation p = tanh(^ Jp). Moreover, by the properties (i-iii) from the proof of 
Lemma 4.4 we know that p(J) = for J < 2 while ^ J(l - p(J)^) < 1 once J > 2. From 
these observations we learn that if yk = yfc+i, then Jy < 2 and pk = l^k+i = 0- On the other 
hand, if yk > yt+i, then Jy > 2, y^. = iy(l + p{Jy)'^) and y^+i = ^{1 - p{Jyf) so, in 
particular, Jy^ > 1 > Jyk+i- However, that forces that k = 1, because otherwise we would also 
have Jyk-i > 1 and J{yk-i + yk) > 2, implying that (y, jl) is not a local minimum of K^^ in 
the (A; — 1, A;)-th sector. Hence, y2 = ■ ■ ■ = yr and /X2 = • ■ ■ = /x^ = 0, while if yi > y2, then 
pi = ±p{J)/zi 7^ 0. □ 

The proof of Lemma 4.7 gives us the following useful observation: 

Corollary 4.8 Let m = {mi, m2 , • • • , rrij. ) be contained in Conv and suppose that mi , m2 7^ 
0. Then one of the four vectors 

(mi ±m2, 0,777.3, ... ,771^), (0,77i2 ±mi,m3, . . . ,mr) (4.34) 

corresponds to a magnetization m' G Conv(J7) with 0j{m') < #j(m). 

Proof. Since m is in the interior of Conv(O), there exists {y, p) where the infimum (4.28) is 
achieved. Let zi, Z2, I'l and V2 be related to yi, 2/2, Pi and p2 as in (4.31-4.33). Now by 
(4.32) the free energy of the corresponding sector of (y, p) equals the sum of the free energies 
of two decoupled Ising models with biases pi and p2. Without loss of generality, suppose that 
Pi > P2 > 0. Recalling the property (iii) from the proof of Lemma 4.4, p ^ &j{p) decreases 
when p > gets closer to the non-negative local minimum. Thus, if pi is nearer to the local 
minimum of Ojy than p2, by increasing p2 we lower the free energy by a non-trivial amount. 
Similarly, if p2 is the one that is closer, we decrease pi . 

By inspection of (4.33), the former operation produces a new quadruple z'l, z'2, v'^ and v'^, with 
= and z'lv']^ = pi. But that corresponds to the magnetization vector [m'l^m^, ma, ... ,mr), 
where 

^'1 = Piy — ^1 + ^2 and m2 = 0, (4.35) 
which is what we stated above. The other situations are handled analogously. □ 

Now we are finally ready to establish the claim about local/global minima of (pj: 
Proof of Proposition 2.2. By Lemma 4.6, every local minimum of corresponds to a local 

(r) 

minmium oiKy. Thus, using Lemma 4.7 we know that all local minima m of <Pj will have 
at most one non-zero component. Writing a; = (1, 0, . . . , 0), h = huj and m = ma;, we can 
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use the formalism from Section 4.1. In particular, the on-axis moment generating function g{h) 
is given by 

g{h) = - log(2r) + log(r - 1 + cosh h). (4.36) 

Differentiating this expression, (4.12) shows that every local minimum m has to satisfy the equa- 
tion (2.16). Now, for r > 2, a little work shows that h i— > g'{h) is convex for 

(r - 1)^ - (r - 1) cosh h + 2>0 (4.37) 

and concave otherwise. In particular, for r > 3, the equation (2.16) has either one non-negative 
solution m = or three non-negative solutions, m = 0, m = m_{J) and m = m_|_(J), where 
< m-{J) < mj^{J). However, m+(J) is "dynamically stable" and, using Lemma 4.3, m-{J) 
never corresponds to a local minimum. 

To finish the proof we need to show that m = (m_|_( J), 0, .... 0) is a local minimum of 
the full ^j. If the contrary were true, we would have a sequence tending to m such 
that ^j(mfc) < ^j(m). Then an (r — l)-fold use of Corollary 4.8 combined with the sym- 
metry of implies the existence of a sequence = (m^, 0, .... 0) tending to m and sat- 
isfying ^j{m'^) < <^>j{mk) for all k. But that contradicts that m+(J) is a local minimum of 
the on-axis mean-field free energy function. So m was a local minimum of <Pj after all. The 
existence of a unique mean-field transition point Jmf is a consequence of Lemma 4.2 and the fact 
that m = ceases to be a local minimum for J > r. □ 

Proof of Theorem 2.3. The proof is basically identical to that of Theorem 2.1, so we will be 
rather sketchy. First we note that m^{J) is achieved at some extremal translation-invariant state 
whose magnetization m,^ is an element of ^^.{J). Let 5^. = \rld and define ei as in (4.23). 
Then has to be within ei from a local minimum of #j. While this time we cannot proclaim 
that J ^ m^{J) is non-decreasing, all the benefits of monotonicity can be achieved by using 
the monotonicity of the energy density e^(J). Indeed, J ^-k{J) is non-decreasing and, by 
Corollary 1.2 and the Key Estimate, we have 



< ^rh = JSd- (4.38) 



But then e*(J) must undergo a unique large jump at some Jt from values e*(J) < 2JSd to 
values near ^m,MF{J)'^ by less than 2J5d- So m*( J) has to jump at J = Jt as well, in order to 
obey (4.38). The width of the "transition region" is controlled exactly as in the case of the Potts 
model. □ 



4.4 Nematic model. 

The nematic models present us with the difficulty that an expUcit formula for <Pj(m) seems 
impossible to derive. However, the situation improves in the dual Legendre variables. Indeed, ex- 
amining (1.4-1.6), it is seen that the stationary points of c^j(m) are in one-to-one correspondence 
with the stationary points of the (Gibbs) free-energy function 

^Jih) = ^,\h\^-G{h), (4.39) 
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via the relation h = Jm. (In the case at hand, h takes values in which was defined as the 
space of all N X N traceless matrices.) Moreover, if m = VG{h), then we have 

^j{h) - ^j{m) = ^\h- Jmp (4.40) 

so the values iP'j (m) and {h) at the corresponding stationary points are the same. Furthermore, 
some juggling with Legendre transforms shows that if m is a local minimum of then h = Jm 
is a local minimum of ^j. Similarly for local maxima and saddle points of ^j. 

Lemma 4.9 Each stationary point of ^j{h) on is a traceless N x N matrix h with eigen- 
values that can be reordered to the form hi > h2 = • ■ ■ = Hn- 

Proof. The claim is trivial for A'^ = 2 so let A'^ > 3. Without loss of generality, we can restrict 
ourselves to diagonal, traceless matrices h. Let h = diag(/ii, . . . , Hn) be such that J2a ~ 
and let Va, with a = 1, . . . , AT, be the components a unit vector in M^. Let (— )o be the 
expectation with respect to the a priori measure on J7 and let (— )/i be the state on tilted by h. 
Explicitly, we have 

{f)h = e-^e^) / f^idv)fiv) e^p{j2havl} (4.41) 

a=l 

for any measurable function / on the unit sphere in M^. 

As in the case of the Potts and cubic models, the proof will be reduced to the two-component 
problem. Let /i be a stationary point of *j and let a and (3 be two distinct indices between 1 
and N. The relevant properties of {—)h are then as follows: 

(i) If J{vi + vph > 3, then ^ h^. 

(ii) If ha > hp, then J{vi)h > § > J{vph- 

The proof of these facts involves a non-trivial adventure with modified Bessel functions, In{x), 
where n is any non-negative integer and In{x) = ^ d6 ^ cos{n6) . To keep the com- 
putations succinct, we introduce the polar coordinates, Va = r cos and vp = r sin 9, where 
9 G [0, 27r) and r > 0. Let (— )a./3 denote the expectation with respect to the r-marginal of the 
state where h' = dmg{h[, . . . , h'j^) is related to h via h'^ = ^/j = ^(^a + hp), while 

h'^ = hj for 7 7^ a,/3. ExpHcitly, if f{r,9) corresponds to f{va,vp) via the above change of 
coordinates, then 




(4.42) 



where A = ^{ha — hp). 

We begin by deriving several identities involving modified Bessel functions. First, a straight- 
forward calculation shows that 



{vl-vl)H = Aap{A){r'h{r'A)) 



(4.43) 
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where A^piA) ^ = {Io{r^A))af3. Similarly we get 

{vlvj)f, = A^A) (yiloir^A) - h{r^A))) 



{AAA) 



But /o(a;) — l2{x) = {2/x)Ii{x), whereby we have the identity 



(4.45) 



A similar calculation using trigonometric formulas shows that 



{vi)H = A„p{A) {r\llo{r^A) + \h{r'A) + \h{r^l^))^^ 
{vt)H = A^p{A) {r\llo{r'A) - iJilr^A) + iHr'A))) 



(4.46) 



(4.47) 



In particular, since Iq (0) = 1 while Ii (0) = I2 (0) = 0, we have 



(4.48) 



The identities (4.44^.48) will now allow us to prove (i-ii). 

First we note that he fact that h was a stationary point of implies that — = J{v^ — 
vii)h for all 7, 7' = 1, . . . , A'^. Using this in (4.45), we have the following dichotomy 



To establish (i), suppose that J{v'^+v'^)h > 3 but ha = hp. Then (4.48) gives us 2J{vav'^)h > 1> 
in contradiction with (4.49). Hence, (i) must hold. To prove (ii), assume that ha > hp and note 
that then A > 0. Applying that h{x) > and hix) > for x > in (4.46), we easily show 
using (4.46) that {Va)h > 'i{vaVp)h- Similarly, the bound h{x) > hix) for a; > 0, applied in 
(4.47), shows that < 3(f ^t'Dh- From here (ii) follows by invoking (4.49). 

Now we are ready to prove the desired claim. Let h. be a stationary point. First let us prove 
that there are no three components of h such that ha > hp > h^. Indeed, if that would be the 
case, (i-ii) leads to a contradiction, because /i„ > hp would require that J{vg)fi < 3/2 while 
hp > hj would stipulate that J{v'^)h > 3/2! Thus, any stationary point h of can only have 
two values for {Va)h- However, if (say) both {v\)h and take on the larger value (implying 
that hi = h2), then J{vf + V2)h > 3 and h caimot be a stationary point. From here the claim 
follows. □ 

The symmetry of the problem at hand allows us to restrict ourselves to the on-axis formalism 
from Section 4.1. In particular, we let u) = diag(l, —jf^, ■ ■ ■ , ~]\rrx)' ^ ~ ^^'^ ^ ~ 
and define the functions g{h), s{X) and ^j(A) as in (4.10^.11). Lemma 4.9 in turn guarantees 
that all local minimizers of <Pj appear within the domain of ^j. What remains to be proved is the 
converse. This can be done using some of the items established above. 

Lemma 4.10 Suppose that A is a stationary point of the scalar free energy (f)j which satisfies 
Jg"{J\) < 1. Then A = Aa;, with uj = diag(l, —j^, ■ ■ ■ , — t^z^), is a local minimizer of^j. 

Proof. To simplify the exposition, we will exploit the 0(iV)-symmetry of the problem: If g G 
0{N, M) is any N x N orthogonal matrix, then 



either ha = hp or 2J{v'^v'^)h = 1. 



(4.49) 



#j(m) =^j{g ^mg), 



(4.50) 
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with similar considerations applying to ^j{h). Thus, for all intents and purposes, we may assume 
that the arguments of these functions are already in the diagonal form and regard the diagonal as 
an iV-component vector. (Indeed, we will transfer back and forth between the vector and matrix 
language without further ado.) 

Again we are forced to work with the dual variables. To that end, let V'j(^) be the quantity 
\u}\~^W'j{hu}). Clearly, the relation between and (f>j is as for and <Pj. First, let us demon- 
strate that every stationary point of the scalar free energy ipj represents a stationary point of the 
full ^j. Indeed, let K be the orthogonal complement of vector u) in M^. As a simple computation 
shows, any k eK has a zero first component. If fc = (0, A;2, . . . , k^) £ IK is small, then 



G{hu} + k) = G{hu}) + 



13 



hu) 



(4.51) 



where {—)h is as in (4.41). Now {vp)h^ is the same for all /3 



2,. . . ,N, and in the view 
of the fact that = 0, the expectation vanishes. Hence, V^j{hoj) has all components 

corresponding to the subspace IK equal to zero. Now if /i is a stationary point of we know 
that {u, V^j{hu)) = and thus Vl^j{huj) = as claimed. 

To prove the desired claim, it now suffices to show that the Hessian of is positive definite 
at h = h*u} when h* satisfies Jg"{h*) < 1. (Recall that the corresponding stationary points 
of i/jj and (pj are related hy h = JA.) This in turn amounts to showing that VVG(/ia?) is 
dominated by the J^^-multiple of the unit matrix. Although we must confine ourselves to En, it 
is convenient to consider the Hessian of G{h) in a larger space which contains the constant vector 
and restrict our directional probes to vectors from Eq. In general, the entries of the Hessian are 
given in terms of truncated correlation functions: 



(Hess(G')) = {vlv^n - {vl)h{v})h- 



y(3 

For the problem at hand, there are only four distinct entries 



(4.52) 



/ A B 
B C 



Hess(G) 



D 



D 



B D 



B \ 
D 



C D 

D c y 



(4.53) 



Clearly, oj itself is an eigenvector of Hess(G) with the eigenvalue A — B. On the other hand, if 
fe G K, then the first row and column of Hess (G) are irrelevant. Writing the remaining (iV — 
1) X (A^ — 1) block in the form (C — D)1 + C S, where S is the matrix with all entries equal to 
one, it follows easily that all of K is an eigenspace of Hess(G) with eigenvalue C — D. 

It remains to show that these eigenvalues are strictly smaller than J~^. The first one, namely, 
A — B is less than by our assumption that Jg"{h*) < 1. As to the other eigenvalue, C — D, 
we note that 

'i)h-{vlv})n, a>(3>l. (4.54) 



D 
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Now, equation (4.48) tells us that, under our conditions, {v'^Vp)h equals ^ {v%)h- So we need that 
\{v%)h is less than J. But since hi = h* > ha, that is exactly the condition (ii) derived in the 
proof of Lemma 4.9. □ 

Now we are ready to establish our claims concerning the local minima of <Pj: 
Proof of Proposition 2.4. Let u> be as above and note that |a;|^ = N/ {N — 1). Then the on-axis 
moment generating function from (4.10) becomes 

gih) = log J TTNidv) e'^^(''i-^), (4.55) 



where ttat is the uniform probabihty measure on the unit sphere in and vi is the first compo- 
nent of v. An argument involving the A?^-dimensional spherical coordinates then shows that 

7rN{viedx) = C{N){l-x^)^dx, (4.56) 

where C{N) is the ratio of the surfaces of the unit spheres in M^~^ and M^. By substituting this 
into (4.55) and applying (4.12), we easily find that, in order for A = Xuj to be a local minimum 
of the scalar A has to satisfy the equation (2.24). 

A simple analysis of (2.24) shows that for J <C 1, the only solution to (2.24) is A = 0, while 
for J > N"^, the solution A = is no longer perturbatively stable. Since Lemma 4.2 guarantees 
that the norm of all global minimizers increases with J, there must be a unique Jmf £ (0, oo) 
and a non-decreasing function J i-^ Xmf{J) such that Xmf{J) solves (2.24) and that every global 
minimizer of cpj at any J > Jmf which is a continuity point of J i-^ Amf(</) corresponds to 
A = Amf(«^)- (At any possible point of discontinuity of J i— > Xmp^J), the A corresponding to 
any global minimizer is sandwiched between lim Amf(«^') and limj/^j Amf(«^')-) The claim 
is thus proved. □ 

In order to prove the large- part of our statements concerning the mean-field theory of the 
nematic model, we will need to establish the following scaling property: 

Lemma 4.11 Let <P^j^^ denote the free-energy function of the 0{N)-nematic Hamiltonian. In- 
troduce the matrix uj = diag(l, — jy^, • • • , —77^) define the normalized mean-field free- 



energy function 

(A) = l|c^|-2^(^)(Ac.), A<L (4.57) 



Then, as N ^ 00, the function A <—>■ (f)[^\x) converges, along with all of its derivatives, to the 
function 



^^r\^) = -i^' + l^og^^. (4.58) 



Proof. The proof is a straightforward application of Laplace's method to the measure on the 
right-hand side of (2.24). Indeed, for any h>0, consider the measure p/j^jv on [0, 1] defined by 



PhM^^) = J! .."^^ dx. (4.59) 

£ dx{l - x^) — e^^^ 
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Noting that the function x i-^ (1 — x^) 2 e'*^^ has a unique maximum at a; = Xh, where 

4 = max{0,l-^}, (4.60) 

we easily conclude that 

lim PhM-) =^xh{-), (4.61) 

AT— >oo 

where 5a{-) denotes the Dirac point mass at x = a. Here the limit taken in the sense of weak 
convergence on the space of all bounded continuous functions on [0,1]. The proof of this amounts 
to standard estimates for the Laplace method; we leave the details to the reader. 

Let gN{h) denote the function g{hN) where g is as in (4.55). Since any derivative of gN{h) 
can be expressed as a truncated correlation function of measure ph,N^ we easily conclude that 
gnih) converges, along with all of its derivatives, to the function 

g^ih) = lim gNih) = maxjo, h -]--)- log(2/i)), (4.62) 

for all h > 0. Now, the function sn(X) = -^\uj\~'^S{Xu}) — where S{-) is the entropy of the 
0(iV)-nematic model — ^is the Legendre transform of ^^v, so we also get 

Soo(A) = lim sn{\) = log (4.63) 

N^oo Z i — A 

(Again, the convergence extends to all derivatives, provided A < 1.) From here the claim follows 
by noting that 0'y^-'(A) = — -^A^ — Siv(A), which tends to (l)^j°\x) in the desired sense. □ 

Proof of Proposition 2.5. By Lemma 4.11, the scaled mean-field free-energy function is, 

along with any finite number of its derivatives, uniformly close to on compact subsets of 

[0, 1), provided N is sufficiently large. Now the local minima of will again satisfy a mean- 
field equation, this time involving the function g^o from (4.62). Since 

{1 L if /) > i 

0, otherwise, 

there are at most two perturbatively stable solutions to the mean-field equation: One at A = and 
the other at 

A = i (1 + Vl - 4J-2) . (4.65) 

Moreover, these local minima interchange the role of the global minimum at some finite and 
non-zero , which is a solution of a particular transcendental equation. For J near , the 

second derivative of j^'* is uniformly positive around both local minima. 

The convergence stated in Lemma 4.1 1 ensures that all of the previously listed facts will be (at 
least qualitatively) satisfied by for large as well. Thus, ^ has at most one positive local 
minimum, which immediately implies that J ^ '^mf^(<^) is continuous whenever it is defined. 
Moreover, since the local minima of (j)"^'^ converge to those of (/>j°°\ we also easily recover the 
asymptotic statements (2.26-2.27). This finishes the proof. □ 
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Proof of Theorem 2.6. The proof is similar to that of the Potts and cubic models; the only extra 
impediment is that now we cannot take for granted that there is only one non-zero local minimum. 

As before, most of the difficulties will be resolved by invoking the monotonicity of the energy 
density e*(J), which is defined e.g. by optimizing \{{Qq, Qx))j over all Gibbs states invariant 
under the lattice translations and rotations. 

In the present case, n and n in the Main Theorem are given by k = (A^ — 1)/N and n = 
^N{N — 1). Thus, letting S^t = j{N — 1)^1^, the quantity JS^ is the corresponding error term on 
tiie right-hand side of (1.12). Define ei by the formula (4.23). Then Lemma 4.9 guarantees that 
the diagonal form A of {Qq)j for any Gibbs state is an index permutation of a vector of the type 

[^ + o,i,-j^^^ + a2,... ,-j;^^^ + aNj, (4.66) 

where Oj = 0, a? < ef and A corresponds to a local minimum of 'Pj. If A is the physical 
magnetization giving rise to A*(J), we let AJ^p(J) be a value of A, corresponding to a local 
minimum of #j, for which A takes the form (4.66). Then Corollary 1 .2 and the Key Estimate give 

- ^]v^^mf(^)'| < 2J<^d. (4.67) 

Now for J < Jo ^ 1, we know the only local minimum is for Aj^p( J) = 0, while for J > Ji > 
iV^, the zero vector is no longer a local minimum and hence AJ^p( J) exceeds some x' > 0. But 
J H- > e*(J) is non-decreasing so there must be a Jt G [Jq, Ji] where e*(J) jumps by at least 
x' — 2 Jt^d, which is positive once d is sufficiently large. The fact that Jt must be close to Jmf 
for large enough d is proved exactly as for the Potts and cubic models. □ 



5. Mean-field theory and complete- graph models 

Here we will show that the mean-field formalism developed in Section 1.2 has a very natural 
interpretation for the model on a complete graph. An important reason for the complete graph 
picture is to provide a tangible physical system to motivate some of the physical arguments. The 
forthcoming derivation is a rather standard exercise in large-deviation theory [16, 19], so we will 
keep it rather brief. 

We will begin by a precise definition of the problem. Let t/A? be a complete graph on N vertices 
and consider a spin system on ^jv vvith single-spin space J7 and the Hamiltonian 

7 ^ 
PHMiS) = -- iS.,Sy)-J2ib,Sx). (5.1) 

l<x<y<N x=l 

(Recall that is a compact subset of a finite-dimensional vector space Kq with inner product 
denoted as in the previous formula.) Let /x denote the a priori spin measure and let (— )o denote 
the corresponding expectation. For each configuration S, introduce the empirical magnetization 
by the formula 

1 ^ 

x=l 
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If m G Conv(ri) and e > 0, let (m) denote the e-neighborhood of m in Conv(ri) in the metric 
induced by the inner product on Eq. Then we have: 

Theorem 5.1 For each m G Conv(J7), 

where ^j^b(m) is as defined in Section 1.2. Moreover, ifv^ denotes the Gibbs measure obtained 
by normalizing e~^^^^^^ and if Fysi{J, b) denotes the infimum of^j^b{m) over m G Conv(O), 
then 

lirn^ iyN{^j,b{^N{S)) > Fmp{J, 6) + e) = (5.4) 

for every e > 0. 

Proof By our assumption, En is a finite-dimensional vector space. Moreover, Q, is compact 
and thus the logarithmic generating function G{h) defined in (1.3) exists for all h G Eq. As a 
consequence of Cramer's Theorem for i.i.d. random variables on R", see Theorem 2.2.30 in [16], 
the measures 

I^Ni-) = l^{mN{S) e ■) (5.5) 
satisfy a large-deviation principle on with rate function (1.4). In particular, 

lim^im log {Ue{m)) = S{m), m G Conv(O). (5.6) 

Now can be written as follows 

N 



(3Hn = NEj^b{mN{S)) - ^ ^(^^, S^). (5.7) 

x=l 



Since the second term is bounded by a non-random constant almost surely and since m ^ 
Ej^ij(m) is uniformly continuous throughout Conv(O), (5.3) follows by inspecting the definition 
of#j,6(m). □ 
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